深度学习中常用的图像数据增强方法-纯干货

Original gloomyfish OpenCV学堂 2020-02-04

微信公众号：OpenCV学堂
关注获取更多计算机视觉与深度学习知识;
觉得文章对你有用，请戳底部广告支持

图像数据增强方法概述

图像数据准备对神经网络与卷积神经网络模型训练有重要影响，当样本空间不够或者样本数量不足的时候会严重影响训练或者导致训练出来的模型泛化程度不够，识别率与准确率不高！本文将会带你学会如何对已有的图像数据进行数据增强，获取样本的多样性与数据的多样性从而为训练模型打下良好基础。通读全文你将get到如何几个技能：

使用标准化对图像进行图像增强
使用几何变换（平移、翻转、旋转）对图像进行数据增强
使用随机调整亮度对图像进行增强
使用随机调整对比度对图像进行增强

演示基于mnist数据集，使用tensorflow+opencv，随机获取9张28x28的大小的数据图像，然后进行处理，处理之后通过opencv来显示结果。加载mnisnt数据集，获取随机9张图像，显示的代码如下：

from tensorflow.examples.tutorials.mnist import input_data
import tensorflow as tf
import numpy as np
import cv2 as cv
mnist = input_data.read_data_sets("MNIST_data/", one_hot=True)
batch_xs, batch_ys = mnist.train.next_batch(9)


def show_images(images_data, win_name):
    plot_image = np.zeros(shape=[96, 96], dtype=np.float32)
    for i in range(0, 9):
        col = i % 3
        row = i // 3
        plot_image[row*28:row*28+28, col*28:col*28+28] = images_data[i].reshape(28, 28)

    # show the plot
    cv.imshow(win_name, cv.resize(plot_image, (256, 256)))


batch_xs = batch_xs.reshape(batch_xs.shape[0], 1, 28, 28)
show_images(batch_xs, "batches")
sess = tf.Session()
print(batch_xs.shape)

选择9张mnist图像

图像标准化

关于图像标准化的原理，可以看本公众号以前的文章即可，点击如下链接即可查看：

深度学习训练-详解图像数据标准化与归一化

标准化的图像增强代码如下：

def standardization():
    results = np.copy(batch_xs)
    for i in range(9):
        image = sess.run(tf.image.per_image_standardization(batch_xs[i].reshape(28, 28, -1)))
        results[i, :, :, :] = image.reshape(-1, 28,28)
    show_images(results, "standardization")

标准化增强如下

翻转、旋转

图像几何变换通常包括图像的平移、翻转、旋转等操作，利用图像几何操作实现图像数据增强。
翻转操作代码如下：

def random_flip():
    copy = np.copy(batch_xs)
    copy = np.squeeze(copy, axis=1)
    copy = np.expand_dims(copy, axis=3)
    flip_results = sess.run(tf.image.flip_left_right(copy))
    flip_results = np.squeeze(flip_results, axis=3)
    flip_results = np.expand_dims(flip_results, axis=1)
    print(flip_results.shape)
    show_images(flip_results, "flip_left_right")

翻转增强之后显示

旋转操作代码如下：

def random_rotate():
    results = np.copy(batch_xs)
    for i in range(9):
        image = sess.run(tf.image.rot90(batch_xs[i].reshape(28, 28, -1), i%4+1))
        results[i, :, :, :] = image.reshape(-1, 28,28)
    show_images(results, "random_rotate")

随机90度旋转操作增强之后

随机亮度

随机亮度通过调整图像像素值改变图像亮度，这种方式对图像进行数据增强的代码如下：

def random_brightness():
    results = np.copy(batch_xs)
    for i in range(9):
        image = sess.run(tf.image.random_brightness(batch_xs[i].reshape(28, 28), 0.9))
        results[i, :, :, :] = image.reshape(-1, 28,28)
    show_images(results,"random_brightness")

随机亮度增强之后显示

随机对比度

随机对比度，通过调整图像对比度来对图像进行数据增强，代码实现如下：

def random_contrast():
    results = np.copy(batch_xs)
    for i in range(9):
        image = sess.run(tf.image.random_contrast(batch_xs[i].reshape(28, 28, -1), 0.85, 1.5))
        results[i, :, :, :] = image.reshape(-1, 28,28)
    show_images(results, "random_contrast")

随机对比度增强之后显示

python运行调用

random_flip()
random_brightness()
random_contrast()
random_rotate()
standardization()
cv.waitKey(0)
cv.destroyAllWindows()

【推荐阅读】

OpenCV Gabor滤波器实现纹理提取与缺陷分析

OpenCV中如何获得物体的主要方向

tensorflow中实现神经网络训练手写数字数据集mnist

新课程发布 - 《tensorflow零基础入门视频教程》

tensorflow中实现神经网络训练手写数字数据集mnist

Windows系统如何安装Tensorflow Object Detection API

使用Tensorflow Object Detection API实现对象检测

寇可为，我复亦为寇可往，我复亦往

插播一条广告，欢迎加入【OpenCV研习社】体系化学习计算机视觉，掌握OpenCV+tensorflow编程技术，扫描下面二维码即可加入，给自己未来加油！

观察｜官方通报陕西蒲城一职校学生坠亡：事发前与舍友发生口角和肢体冲突认定该生系高空坠落死亡

桐城一派｜倒在“跨年夜”的龚书记，13个字换来免职调查冤不冤？

比佟丽娅还恋爱脑，怀孕7次流产4次，目睹丈夫背叛却选择原谅

市管干部“龚书记”免职迷局

讣告！又一知名女星在家中去世，终年54岁，曾是无数人白月光…